AITopics | gradient boosted decision tree

SketchBoost: Fast Gradient Boosted Decision Tree for Multioutput Problems

Neural Information Processing SystemsDec-24-2025, 21:52:56 GMT

Gradient Boosted Decision Tree (GBDT) is a widely-used machine learning algorithm that has been shown to achieve state-of-the-art results on many standard data science problems. We are interested in its application to multioutput problems when the output is highly multidimensional. Although there are highly effective GBDT implementations, their scalability to such problems is still unsatisfactory. In this paper, we propose novel methods aiming to accelerate the training process of GBDT in the multioutput scenario. The idea behind these methods lies in the approximate computation of a scoring function used to find the best split of decision trees. These methods are implemented in SketchBoost, which itself is integrated into our easily customizable Python-based GPU implementation of GBDT called Py-Boost. Our numerical study demonstrates that SketchBoost speeds up the training process of GBDT by up to over 40 times while achieving comparable or even better performance.

fast gradient boosted decision tree, gradient boosted decision tree, sketchboost, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

5b84864ff8474fd742c66f219b2eaac1-Supplemental-Conference.pdf

Neural Information Processing SystemsNov-14-2025, 10:06:46 GMT

black-box model, dataset, interpretable model, (11 more...)

Neural Information Processing Systems

Country: North America > United States > Florida > Broward County (0.04)

Genre: Research Report (0.72)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

5b84864ff8474fd742c66f219b2eaac1-Supplemental-Conference.pdf

Neural Information Processing SystemsAug-15-2025, 02:19:33 GMT

black-box model, dataset, interpretable model, (11 more...)

Neural Information Processing Systems

Country: North America > United States > Florida > Broward County (0.04)

Genre: Research Report (0.71)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

SketchBoost: Fast Gradient Boosted Decision Tree for Multioutput Problems

Neural Information Processing SystemsJan-18-2025, 07:38:20 GMT

Gradient Boosted Decision Tree (GBDT) is a widely-used machine learning algorithm that has been shown to achieve state-of-the-art results on many standard data science problems. We are interested in its application to multioutput problems when the output is highly multidimensional. Although there are highly effective GBDT implementations, their scalability to such problems is still unsatisfactory. In this paper, we propose novel methods aiming to accelerate the training process of GBDT in the multioutput scenario. The idea behind these methods lies in the approximate computation of a scoring function used to find the best split of decision trees.

fast gradient boosted decision tree, gradient boosted decision tree, sketchboost, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.66)

Add feedback

Predicting Subway Passenger Flows under Incident Situation with Causality

Huang, Xiannan, Qiu, Shuhan, Yuan, Quan, Yang, Chao

arXiv.org Artificial IntelligenceDec-9-2024

In the context of rail transit operations, real-time passenger flow prediction is essential; however, most models primarily focus on normal conditions, with limited research addressing incident situations. There are several intrinsic challenges associated with prediction during incidents, such as a lack of interpretability and data scarcity. To address these challenges, we propose a two-stage method that separates predictions under normal conditions and the causal effects of incidents. First, a normal prediction model is trained using data from normal situations. Next, the synthetic control method is employed to identify the causal effects of incidents, combined with placebo tests to determine significant levels of these effects. The significant effects are then utilized to train a causal effect prediction model, which can forecast the impact of incidents based on features of the incidents and passenger flows. During the prediction phase, the results from both the normal situation model and the causal effect prediction model are integrated to generate final passenger flow predictions during incidents. Our approach is validated using real-world data, demonstrating improved accuracy. Furthermore, the two-stage methodology enhances interpretability. By analyzing the causal effect prediction model, we can identify key influencing factors related to the effects of incidents and gain insights into their underlying mechanisms. Our work can assist subway system managers in estimating passenger flow affected by incidents and enable them to take proactive measures. Additionally, it can deepen researchers' understanding of the impact of incidents on subway passenger flows.

artificial intelligence, incident, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2412.06871

Country:

Asia > China > Shanghai > Shanghai (0.05)
North America > United States > District of Columbia > Washington (0.04)
North America > United States > California (0.04)
(6 more...)

Genre: Research Report > Experimental Study (0.46)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Rail (1.00)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

High-Order Optimization of Gradient Boosted Decision Trees

Pachebat, Jean, Ivanov, Sergei

arXiv.org Artificial IntelligenceNov-21-2022

Gradient Boosted Decision Trees (GBDTs) are dominant machine learning algorithms for modeling discrete or tabular data. Unlike neural networks with millions of trainable parameters, GBDTs optimize loss function in an additive manner and have a single trainable parameter per leaf, which makes it easy to apply high-order optimization of the loss function. In this paper, we introduce high-order optimization for GBDTs based on numerical optimization theory which allows us to construct trees based on high-order derivatives of a given loss function. In the experiments, we show that high-order optimization has faster per-iteration convergence that leads to reduced running time. Our solution can be easily parallelized and run on GPUs with little overhead on the code. Finally, we discuss future potential improvements such as automatic differentiation of arbitrary loss function and combination of GBDTs with neural networks.

artificial intelligence, loss function, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2211.11367

Country: Europe > France > Île-de-France > Paris > Paris (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.86)

Add feedback

Bank Customer Churn Prediction Using Machine Learning

#artificialintelligenceOct-3-2022, 08:06:07 GMT

This article was published as a part of the Data Science Blogathon. Customer Churn prediction means knowing which customers are likely to leave or unsubscribe from your service. For many companies, this is an important prediction. This is because acquiring new customers often costs more than retaining existing ones. Once you've identified customers at risk of churn, you need to know exactly what marketing efforts you should make with each customer to maximize their likelihood of staying.

algorithm, customer, decision tree, (13 more...)

#artificialintelligence

Industry: Banking & Finance (0.42)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)

Add feedback

Gradient Boosted Decision Trees explained with a real-life example and some Python code

#artificialintelligenceAug-19-2021, 06:10:37 GMT

Gradient Boosting algorithms tackle one of the biggest problems in Machine Learning: bias. Decision Trees is a simple and flexible algorithm. An underfit Decision Tree has low depth, meaning it splits the dataset only a few of times in an attempt to separate the data. Because it doesn't separate the dataset into more and more distinct observations, it can't capture the true patterns in it. When it comes to tree-based algorithms Random Forests was revolutionary, because it used Bagging to reduce the overall variance of the model with an ensemble of random trees.

algorithm, decision tree, weak learner, (12 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.92)

Add feedback

Finding Influential Training Samples for Gradient Boosted Decision Trees

Sharchilev, Boris, Ustinovsky, Yury, Serdyukov, Pavel, de Rijke, Maarten

arXiv.org Machine LearningMar-12-2018

We address the problem of finding influential training samples for a particular case of tree ensemble-based models, e.g., Random Forest (RF) or Gradient Boosted Decision Trees (GBDT). A natural way of formalizing this problem is studying how the model's predictions change upon leave-one-out retraining, leaving out each individual training sample. Recent work has shown that, for parametric models, this analysis can be conducted in a computationally efficient way. We propose several ways of extending this framework to non-parametric GBDT ensembles under the assumption that tree structures remain fixed. Furthermore, we introduce a general scheme of obtaining further approximations to our method that balance the trade-off between performance and computational complexity. We evaluate our approaches on various experimental setups and use-case scenarios and demonstrate both the quality of our approach to finding influential training samples in comparison to the baselines and its computational efficiency.

artificial intelligence, leaf value, machine learning, (16 more...)

arXiv.org Machine Learning

1802.0664

Country: